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Abstract 

We consider two classes of random graphs: 

(a) Poissonian random graphs in which the n vertices in the graph 
have i.i.d. weights distributed as X, where E(X) = /j,. Edges are 
added according to a product measure and the probability that a 
vertex of weight x shares and edge with a vertex of weight y is given 
by i — e - x y/(M n ) . 

(b) A thinned configuration model in which we create a ground-graph 
in which the n vertices have i.i.d. ground-degrees, distributed as D, 
with E(D) = fi. The graph of interest is obtained by deleting edges 
independently with probability 1 — p. 

In both models the fraction of vertices in the largest connected 
component converges in probability to a constant 1 — q, where q de- 
pends on X or D and p. 

We investigate for which distributions X and D with given \i and 
p, 1 — q is maximized. We show that in the class of Poissonian random 
graphs, X should have all its mass at and one other real, which 
can be explicitly determined. For the thinned configuration model D 
should have all its mass at and two subsequent positive integers. 



1 Introduction 



In this paper we study asymptotic properties of some random graphs as n, 
the number of nodes/vertices, tends to infinity. More specifically, we study 
the size of the largest connected component, within two classes of random 
graphs. If this largest connected component is of the same order as the 
number of nodes, then it is called the giant. We find the random graph 
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that has the largest giant among all random graphs in the class having a 
pre-defined mean degree. 

We consider two types of networks in this paper. Both types of random 
graphs are frequently used extensions of the classical Erdos-Renyi graph [3j 16] . 

• Poissonian random graphs: Poissonian random graphs were intro- 
duced in [10J and are a main example of inhomogeneous random graphs 
jl]. Our model is slightly different from the model introduced in [lOj . 
but asymptotically (for n — > oo) the fraction of vertices in the largest 
connected component of the graph will be the same. 

We construct the random graph of n vertices as follows. First we assign 
independent and identically distributed (i.i.d.) weights to the vertices 
distributed as the non-negative random variable X, with nx '■= E(X) < 
oo. A pair of vertices with weights x and y share an edge with prob- 
ability 1 — exp[— xy/(nfj,x)}, independent of other edges in the graph. 
In [TU] this probability is 1 — exp[— xy/L n ], where L n is the sum of the 
n vertex weights. 

We note that creating the graph and after that removing edges inde- 
pendently with probability 1 — p, is asymptotically in distribution the 
same as immediately creating a Poissonian random graph with weight 
distribution pX. Later we will be interested in properties of thinned 
versions of Poissonian random graphs, but this observation implies that 
these fall under the same model and hence need no additional analysis. 

• Thinned configuration model: The configuration model [HI IE] is 
obtained by assigning i.i.d. numbers (distributed as the non- negative 
integer valued random variable D) of half edges to the n vertices in the 
graph. We assume /id = K(D) < oo. If the total number of half-edges 
is odd, then we add one half-edge to the final vertex. Then, we pair the 
vertices uniformly at random. The probability that a specific graph is 
created, is the same for all graphs with a given degree sequence. Parallel 
edges (edges with the same end vertices) and self-loops (an edge which 
connect a vertex to itself), might occur, but they will not influence 
the asymptotic fraction of vertices in the largest connected component 
(when [i D < oo as we have assumed), ([S]). We clean the graph, by 
removing all self loops and merge all parallel edges. It is easy to check 
that this will not influence the asymptotic degree distribution for /id < 
oo. After this, edges are deleted independently with probability 1 — p. 

Within both of the two random graph models we identify the distribution, 
X and D respectively, which maximizes the size of the giant among all distri- 
bution having some fixed mean fix and fir, and p respectively. The problem 
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of identifying which (random) graph has the maximal giant among all graphs 
with a fixed mean degree /xp is less interesting in that the giant can make up 
the whole population whenever //d > 2, while for \xd < 2, the maximum is 
obtained in any graph, which is a tree. Similarly the problem of minimizing 
the size of the giant among Poissonian random graphs and thinned configu- 
ration models is achieved by chosing P(X = 0) (resp. F(D = 0)) arbitrary 
close to 1, which has the effect that the relative size of the giant goes to zero. 

Random graphs are interesting in their own right but also have numerous 
applications. One such application, which has been the inspiration to many 
scientists, is that of modeling the spread of an infectious disease in a socially 
organized human community, where the social structure of the community 
is described by a random graph [Tj [21 El E] ■ The interpretation of thinning 
the random network when considering the spread of SIR (Susceptible — >■ 
Infectious — > Recovered) epidemics with non-random infectious periods on 
networks is that transmission will take place (if one of the node gets infected) 
exactly along those edges that are kept, and transmission between one in- 
fected node and a susceptible neighbor is modeled to have probability p and 
being independent of other transmission links. In epidemic terminology our 
aim is hence to identify the degree distribution with given mean for which 
the asymptotic fraction of individuals infected, in case of a major epidemic 
outbreak, is maximal. 

2 Notation and some basic results 

Throughout, we will use N for the strictly positive integers and No = N U 
for the non-negative integers. Unless specified otherwise we will use D for 
a random variable taking values in No and X for a non-negative real valued 
random variable. The mean of X is denoted by fix- A bar above a random 
variable denotes the size biased variant of the random variable, i.e. 



The (probability) generating function of an No valued random variable is 
defined by f D (s) := E[s D ] = ££l s fc P(.D = k), for s G [0, 1]._ We sometimes 
use the notation /d(s) = fo-ii 3 )- The smallest root of s = /d(s) is denoted 
by zd and qn is defined by qr> = /d(^d). Some well-known facts about 
generating functions that we will use (and which are easy to check) are: 




or in case of a general positive random variable 
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1- /z?( s ) is analytic on (0, 1) and all derivatives of Jd{s) are non-negative. 

2- /d(1) = 1. 

3 - £fD( s ) = ^dId(s), in particular £f D (s)\ s=1 = \i D . Or equivalently: 

f D (s) = 1 - fi D J f D (x)dx (1) 

for s E [0,1]. 

4. zd is the extinction probability of a Galton Watson branching process 
[7] with offspring distribution D — 1 and one ancestor. is the ex- 
tinction probability of a branching process for which the number of 
children of the ancestor is distributed as D and all other individuals 
have offspring distribution D — 1. 

5. qo and are strictly less than 1 if and only if E(D — 1) > 1. 

For a non- negative real valued random variable X, the distribution of a 
mixed Poisson(X) random variable D is given by F(D = k) = E(^-e^ x ). 
The generating function of this random variable D is given by Jd{s) = 
E(e~( 1-S ) x ). Furthermore, /d(-s) = E(e _< - 1_s - )X ). So the generating function 
of D — 1 is given by the generating function of a mixed Poisson distribution 
based on the size biased variant of X. We note that fin = Hx- 

In this paper, we consider undirected simple graphs. A simple graph is a 
graph with no parallel edges (two or more edges with the same end- vertices) 
or self-loops (edges connecting a vertex to itself). The degree of a vertex is 
the number of edges a vertex is adjacent to. 

A connected component in a graph is a set of vertices for which there is a 
path between every pair of vertices in this set. Let C l {n) be the z-th largest 
connected component (in case of a tie, the order of the tied components is 
uniform at random). The number of vertices in a set S is denoted by |«S|. 

We consider two types of random graphs (as defined in the introduction): 

• Poissonian random graphs: For this model there is, for n — > oo, 
with high probability, at most one giant component jl], i.e., for every 
e > we have, 

lim F(\C 2 (n)\ < en) = 1. 

n— >oo 

Let D be mixed Poisson (X). The fraction of vertices in the giant 
component is for large n, with high probability, close to 1 — qo- More 
precise, for every e > 0, we have 

lim PflrT^C^n)! - (1 - q D )\ < e) = 1. 
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• Thinned configuration model: Let D be the degree distribution of 
the ground-graph and p the thinning parameter. For technical reasons 
(see the remark below) we exclude the model in which both p = 1 
and P(.D = 0) + ¥(D = 2) = 1. For the thinned configuration model 
we also have that the probability that there is more than one giant 
components converges to as n — > oo. In this class of random graphs, 
the generating function of the degree distribution of the thinned graph 
is given by g D , P (s) = f D (l -p + ps) and g D , P { s ) = Jd{^ -p + ps). The 
mean degree of a randomly chosen vertex is pfiD- 

Let zd, p be the smallest root of s = g~D,p{s) and define qo, p = gD,p{zD,p)- 
The fraction of vertices in the giant component is for large n, with high 
probability close to 1 — qo,p- More precise, for every e > 0, we have 

lim PflrT^HI - (1 - q D , p )\ < e) = 1. 

n— >oo 

Remark: If p = 1 and F(D = 0) + P(L> = 2) = 1, then there exists e > 
such that lim n ^oo P(|C 2 (n)| > en) > 0. In this model the fraction of 
the vertices in a component of size at least k converges to F(D = 2) for 
every k G N. However, the fraction of vertices in the largest component 
does not converge to qo, P = P(-D = 2). 

We will show that, for the Poissonian random graphs with given fj, x , the 
limiting size of the giant component is maximized if all vertices have weight 
fix whenever fix > He ~ 1-756. If fi x < fi c , then X should only have mass 
on fi c and 0. Again we note that thinning with a factor p is equivalent to 
replacing X by pX. 

For the thinned configuration model with given fi D and p, the maximal 
giant size is obtained if D has all mass on and two subsequent positive 
integers k and k + 1. We were not able to identify a closed formula for k, 
and the exact mass distribution on the three possible atoms. 

3 Poissonian random graph 

Define fi c as the largest root of 2x = e x ~ x l 2 . The numerical value of fi c is 
approximated by fi c ~ 1.756. 

Let \C x (n)\ be the size of the giant in the Poissonian random graph with 
weight distribution X and n vertices. Again D is mixed Poisson(X). Fur- 
thermore, let qo and zd be as before. Let be the collection of mixed 
Poisson random variables with E(D) = fi. 
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Theorem 3.1 Let D* be mixed Poisson(X* ) and 

P(X* = max(/i,/i c )) = 1 - P(X* = 0) = min(l, /x//x c ), 

then 

min q D = q D * . 

This theorem may be interpreted as follows. If fi < fi c , then D £ T? M 
defined via 

P(X = n B ) = 1 - P(X = 0) = /i//i c 

leads to the Poissonian graph, for which the fraction of vertices in the giant 
converges in probability (as n — > oo) to the largest limit. If /i > /x c , then 
P(X = ji) = 1 (i.e. the Erdos-Renyi graph [3] with mean degree fi) leads to 
the largest giant in this class. 

We first show by a series of three lemmas that the maximal giant is 
obtained for a weight distribution with mass only at and one other real 
number. After that we show that in this class, X* leads to the largest giant 
component, which will complete the proof. 

Lemma 3.2 If A and B are positive real valued random variables, then 

OT>E( B )(^)\ 

Proof: Use H61ders inequality, E[XY) < E[(X a ) 1 / a ] + E[(Y" 6 ) 1/fe ]> for non- 
negative random variables X and Y and a, b > such that a -1 + b^ 1 = 1. 
Filling in a = k, X = AB x l k and Y = _B( fc_1 )/ fc gives the desired result. □ 

Lemma 3.3 Let X be a general non negative random variable and D be 
mixed Poisson(X). If for some s* £ (0, 1), 

f D (s.) :=E(e" x ^)=e- x ^\ 

then E(e~ x ( 1 - S )) < e - ^ 1- ') for s £ [s„ 1]. 

Proof: If /d(s) crosses e _A< - 1_s - ) from below in s*, then we know that 

^f D (s)\ s=s , >Ae-^— ). (2) 
Furthermore, for /c £ N, we have 

^M*)U* = > aV^ 1 -) = |^- A(1 - S) u* . 
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Here the inequality follows by (J2J) and Lemma 13.21 with A = X and B = 
e -x(i-s„)_ gj nce / D (l) = 1 ; and /d(s) is analytic on (0, 1), a Taylor expansion 
in s = s* gives that /d(s) cannot cross e _A( - 1 ~ s - ) from below in s G (0, 1). □ 
Let D := -D(A) be mixed Poisson(X), where X = X(X) is defined by 
P(X = A) = 1 - ¥(X = 0) = /i/A. Let 

/(s; A) := = l-fi/\+ (fi/X)e^ 1 - s \ 

Note that /d( s ) = e~ x ^~ s \ is the generating function of a Poisson A distri- 
bution. Let q(X) := Qd(\) and z(A) := -2d(A)- 

Lemma 3.4 Let X be a general non negative random variable with mean 
fix and let D be mixed Poisson(X). 

(a) If z D > z(fjL X ) = q(fJ'x), then q D > q(fi x )- 

(b) If z D < q(nx), then q D > q(X), where X = - \og[z D ]/(l - z D ). 

Proof: (a) follows from, qp = e ~ x ( 1 - z D)^ Jensen's inequality and the fact 
that /d(s) is increasing, (b) follows from Lemma I3T31 and the fact that 
equals e~ A ^ 1_s ^ = f(s, A) in zp and 1. Using ([TJ completes the proof. □ 

We now show that among the distributions (_D(A);A > 0), the fraction 
of vertices in the giant component will converge in probability to the largest 
limit for D*. 

Lemma 3.5 Let T>'^ be the class of mixed Poisson random variables, where 
D is mixed Poisson(X ) and F(X = A) = 1 — ¥(X = 0) = fi/X, where X > fi. 
For D e T>' . qr> is minimized for X = max(/x, fi c ). 

Proof: We first note that we might assume that A > 1, otherwise qp = 1 
anyway. In what follows we need that z(X) is different iable on (l,oo). We 
prove this by analyzing the derivative of its inverse z~ 1 (x), and show that 
it is non- zero on this domain: From the definition of z(X), we deduce that 
z-i( x ) = zgp. Then 

d i , s 1 — x + x logbl 

t z ( x > = r-i — \~2 — 

ax x(l — xy 

For x G (0, 1), this derivative is strictly negative and finite, by 

-r-\— (1 — x + xloglx])] = — log[x] > 0, 
ax 

and [1 — x + x\og[x]] x=1 = 0. This implies that z(X) is differentiable on the 
domain where it takes values in (0, 1), that is on (1, oo). 
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Since f(z(X); A) — z(X) = 0. We obtain by applying the chain-rule 



0=^[/MA);A)-z(A)] 



dX 



z(X) 



J s=z(X) 



This gives that 



Furthermore, 



d 

dX 



z(X) 



d 



?(A) 



/C*(A),A) 



-^/(«;A) + -^/(a;A)4*(A) 
aA as aA 



dA dA 

Noting that jj/(s; A)| S=2(A ) = /if(z(X)] A) = /iz(A), we get 



S=z(A) 



^(A)j/(»;A) d 



s=z(A) 



Equating this derivative to and using f(s; A) = 1 — /x/A + (w/A)e 
and /(s; A) = e 



-A(l-s) 



-A(l-s) 



gives: 







"^ (A)]2[1 " ^ (A)] + £[1 - *(A)] - £[1 - a(A)]s(A). 

A 



Xz(X) 



A 1 



The solutions of this equation are z(X) = 1 and 21(A) = (2A) 1 . The first 
solution is of no use, because if z(X) = 1, then q(X) = 1 as well. Filling 



in the second solution in z(X) = e 



-A(l-z(A)) 



gives 



2 A = e A_1/2 . Because the 



root of this equality is strictly larger than 1 and because lim^oo q(X) = 1, 
g(A) takes its minimum on (1, 00) in this largest root. The lemma follows by 
observing that A > fi. □ 



Proof of Theorem I3.lt From Lemma 13.41 it follows that for any mixed 

at 
□ 



Poisson distribution D G T>^ there is a distribution D' G T>' such that 



Qd' < Qd- The theorem now follows from Lemma [3.51 



4 Thinned configuration model 

Let £> M be the collection of all No valued random variables D with E(D) = //. 
Let Zd, p be the smallest root of the equation s = gD,p(s) and qo, p = 9d,p{.zd, p )- 
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Theorem 4.1 Let q* = mi D£B ^q Dp , then there exist k 6 N and a degree 
distribution D* e <6(/x), which satisfies 

W>(D* = 0) + P(L>* = Jfe) + P(L>* = Jfe + 1) = 1, 

such that qn*,p — I*- 

Proof: Let q := qn tP and z := Zd >p . First we show that we always can find 
a distribution D' with mass only at and 2 subsequent integers, such that 
the associated q' = qo', P satisfies q > q'. 

Let q" = qD", P and z" = zd», p , where the degree distribution D" is defined 
by ¥(D" = [/ijj = 1 - p+ L/lj = 1 - F(D" = + 1). The generating 
functions associated with the model before thinning are f"(s) and f"{s). We 
now distinguish two cases. 

• If z > z", let k = L/iJ and (3 = // — k. We rewrite D as a mixture of 
two distributions D\ and D 2) with E(Z?i) = and E(_D 2 ) = fc + 1. So 
.D is with probability 1 — and D 2 with probability /3. Then 



where the inequality is obtained by Jensen's inequality. By z > z" and 
the observation that /d"(s) is increasing we note that 



and the statement holds for z > z". 

• If z < z", then we may chose an integer k and a random variable D* 
with mass only at 0, k and k + 1, such that ~E(D) = \i and z^\ p =: z* = 
z. Let P(s) := /ot(s) be its generating function and /^(s) := /ot(s). 
Let a := P(Z)' — 1 = fc). As is done in the previous section, we use 
that by CQ) 



Q = /(! ~P + P^) 



= E((l -p + p^) 

= (1 - /3)E((1 - p + p^) Dl ) + /9E((1 - p + p^ 2 ) 

> (l-/3)(l-p + p^) /£ + /3(l-p + pz)' c+1 

= fD"(l~P + pz). 



q > /d"(! -p + p^") = <?' 




1— p+pz 



(3) 



We note that 



d 2 /( a ) - f\s) 



ds 2 



oo 



[(^ PP - 1 = m)s m - fc+1 ) -(!-«) + as] 



m=0 



oo 



P P - 1 = m)(m — k + l)(m - k)s 



m—k—1 



m=0 
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Since m — k + 1 and m — k cannot have opposite signs, all summands are 
non-negative. Now if for some s £ (0, 1), it holds that f(s) — f'(s) = 
and 4-({f{s) - P(s))s~ k+1 ) > 0, then it is impossible that 



which leads to a contradiction. So ft(s) > f(s) on s £ [z, 1] and by flS} 
the statement of the theorem follows for z < z" . 



An additional question to consider for the thinned configuration model is 
the following. Let B c be the class of all p £ (0, 1] and distributions D with 
K[D] = c/p. For which p and distribution of D in B c is the (in probability 
as n — > oo) limit of n~ 1 \C 1 (n)\ maximized? Note that in these models the 
expected number of edges in the thinned graph is kept constant. 

If c> 2, then p = 1 and F(D = [c\ + 1) = 1 - P(D = [c\ ) = c - [cj give 
<Zr>,p = 0' so the (in probability as n — > oo) limit of n _1 |C 1 (n)| is maximized. 

For c < 2, we first consider the asymptotic branching process and mini- 
mize qo, P over B c . The following heuristic argument gives that qo, P is max- 
imized if p = 1 and ¥(D = 0) + ¥(D = 2) = 1. Assume that either p — 1 
or W(D = 0) + F(D = 2) = 1 does not hold. Then n~ 1 \C 1 (n)\ converges in 
probability to 1 — qD,p- The limit of n~ 1 \C 1 (n)\ is at most c/2, because the 
number of edges in the thinned graph is roughly cn/2 and the number of ver- 
tices in the giant component of a graph is at most 1 higher than the number 
of edges. Note that for p = 1 and F(D = 0) + F(D = 2) = 1, q D , p = 1 - c/2. 
However, if p = 1 and P(D = 0) + P(D = 2) = 1, n-^C^n)! does not con- 
verge to c/2 in probability. Still, the (in probability) limit of n~ 1 \C 1 (n)\ can 
be made arbitrary close to c/2 by chosing e > arbitrary small and taking 
P(£) = 2) = c/2 - 3e, F(D = 3) = 2e and P(D = 0) = 1 - c/2 + e. 

Formally we can minimze qr> )P by applying the change of variables s = 
1 — pt to the right-hand-side of (jSJ), to obtain 



Assume that zd iP < 1, otherwise there will be no giant component anyway. 
Furthermore, assume that P(Z) = 1) < 1. We note that /d(1 — pt) = 1 — t 
has roots in t = and t = 2d iP . By convexity of /d(1 — pt) in t, those are 
the only roots and /d(1 — pt) < 1 — t, for < t < zd, p - This implies that 



(/(l) - /*(!))* 



□ 





(1 - < c 
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So, if c < 2, 1 — q D:P < c/2. On the other hand, the (in probability) limit of 
n _1 |C 1 (n)| can be taken arbitrary close to c/2 by taking p — 1 and F(D = 
2) = c/2 - 3e, P(D = 3) = 2e and P(D = 0) = 1 - c/2 + e for arbitrary small 
e > 0. This answers the question posed. 
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